跳到主要内容

Deep Learning Compiler

  1. Why: to be able to holistically evaluate and optimize the entire network, including across-layer optimizations, which cannot be done by interpreters
  2. What: compiler that translate a computational graph to target-specific kernels that can be executed in the target platform
  3. How:
    1. Graph Level Optimizations
    2. Layout Optimization
    3. Kernel Selection and Kernel Generation
    4. Scheduling
    5. Tensor Allocation
  4. How good: out-of-the-box high performance

实例

  • Myelin
  • TensorRT
  • XLA
  • PyTorch JIT
  • TVM

相关内容

Traditional CompilerDLC
InputHigh-level languagescomputational graph
OutputLow-level languages like asmkernels

参考文献

DLC Overview Slide